(original) A metadata production device, comprising: 
a content reproduction portion that reproduces and outputs content; 
an voice input portion; 

a voice recognition portion that recognizes voice signals that are input from the 
voice input portion; 

a metadata generation portion that converts information recognized by the voice 
recognition portion into metadata; 

an identification information attaching portion that obtains identification 
information for identifying positions within the content from the content and attaches the 
identification information to the metadata; and 

a dictionary that is limited in accordance with the content; 
whereby the generated metadata is associated with positions in the content; and 
the recognition is performed in association with the dictionary, when recognizing 
the voice signals input from the voice input portion with the voice recognition portion. 

2. (cancelled) 

3. (original) The metadata production device according to claim 1, 

wherein the voice signals are recognized by the voice recognition portion word by 
word in association with the dictionary. 

4. (previously presented) The metadata production device according to claim 1, 
further comprising an information processing portion including a keyboard, 

wherein the metadata can be corrected through the information processing portion by 
input from the keyboard. 

5. (previously presented) The metadata production device according to claim 1, 
wherein time code information that is attached to the content is used as the 

identification information. 

6. (previously presented) The metadata production device according to claim 1, 
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wherein content addresses, numbers or frame numbers attached to the content are 
used as the identification information. 

7. (original) The metadata production device according to claim 1, 
wherein the content is still-picture content, and the addresses of the still-picture 

content are used as the identification information. 

8. (currently amended) The metadata production device according to claim 1 , 
wherein 

the content reproduction portion is configured by a content database; 

the voice input portion supplies to the voice recognition portion voice signals of 
entered keywords that have been converted into data with a clock signal that is 
synchronized with a synchronization signal supplied from the content database; 

the voice recognition portion is configured to recognize the keywords from the 
voice signal data that has been converted into data by the voice input portion; 
and 

the metadata generation portion is configured as a file processing portion that 
produces a metadata file by using, as the identification information, a time code that 
indicates a time position of an image signal included in the content, and combining the 
keywords that are output from the voice recognition portion with that time code. 

9. (original) The metadata production device according to claim 8, 
further comprising a recording portion that records the content that is supplied 

from the content database together with the metadata file as a content file. 

10. (currently amended) The metadata production device, according to claim 9, 
further comprising a content information file processing portion that generates a 

control file controlling the relation between the metadata file and recording positions at 
which the content file is to be recorded; 

wherein the control file is recorded in the recording portion together with the 
content file and the metadata file. 
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1 1 . (original) The metadata production device according to claim 8, 

further comprising a dictionary database, wherein the voice recognition portion 
can select a dictionary of a genre corresponding to the content from a plurality of genre- 
dependent dictionaries. 

12. (original) The metadata production device according to claim 1 1 , 
wherein keywords related to the content can be supplied to the voice recognition 

portion; and 

wherein the voice recognition portion is configured to recognize those keywords 
with higher priority. 

13. (original) A method for producing metadata, comprising: voice-inputting 
information related to a given content while displaying the content on a monitor; 
subjecting the input voice signal to voice recognition with a voice recognition device 
using a dictionary that is limited in accordance with the content; converting voice- 
recognized information into metadata; and attaching identification information provided 
to the content for identifying positions in the content to the metadata, thereby associating 
the generated metadata with the positions in the content. 

14. (cancelled) 

15. (cancelled) 

1 6. (original) The method for producing metadata according to claim 13, 
wherein time code information that is attached to the content is used as the 

identification information. 

1 7. (original) The metadata production device according to claim 13, 
wherein the content is still-picture content, and the addresses of the still-picture 

content are used as the identification information. 
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18. (currently amended) A metadata search device, comprising: 
a content database that reproduces and outputs content; 

a voice input portion that converts voice signals of entered keywords into data 
with a clock signal that is synchronized with a synchronization signal of the reproduced 
content; 

a voice recognition portion that recognizes the keywords from the voice signal 
data that has been converted into data by the voice input portion; 

a file processing portion that produces a metadata file by combining the keywords 
output from the voice recognition portion with time codes that indicate a time position of 
an image signal that is included in the content; 

a content information file processing portion that generates a control file 
controlling a relation between the metadata file and recording positions of the content 
file; 

a recording portion that records the content file, the metadata file and the control 
file; and 

a search portion that extracts a recording position corresponding to a keyword in 
the content file by specifying the metadata files in which an entered search keyword is 
included, and referencing the control file; 

wherein the recording position of the content file is the recording position in the 
recording portion. 

19. (original) The metadata search device according to claim 1 8, 

wherein the control file that is output from the content information file processing 
portion is devised as a table that lists recording positions of content in the recording 
portion in accordance with a recording time of the content, and the recording position of 
the content can be searched from the time code. 

20. (original) The metadata search device according to claim 1 8, 

further comprising a dictionary database, and a keyword supply portion that 
supplies keywords related to the content into the voice recognition portion; 
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wherein the voice recognition portion can select a dictionary of a genre 
corresponding to the content from a plurality of genre-dependent dictionaries, and the 
voice recognition portion is configured to recognize those keywords with higher priority. 

2 1 . (original) The metadata search device according to claim 1 8, 
further comprising a dictionary database; 

wherein the voice recognition portion can select a dictionary of a genre 
corresponding to the content from a plurality of genre-dependent dictionaries; and 

wherein the search portion is configured to search by keywords that are chosen 
from a common dictionary used by the voice recognition portion. 
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